Fundamental Uncertainty as a Consequence of Locality 
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In this paper we introduce an operational physical locality principle, branch locality, similar to 
non-signaling but applicable to single systems. Branch locality restricts the dynamics that can 
be implemented in a spatial interferometer. We give a simple proof that any theory with state- 
changing dynamics either: (i) respects a particular uncertainty relation, or (ii) violates branch 
locality. Quantum theory and classical theory are in category (i), whereas category (ii) includes 
the theory known as box world. Box world is typically presented as reasonable because it satisfies 
the non-signaling condition, but here we demonstrate that if it is allowed non-trivial dynamics, 
it must violate a locality principle that any physical theory should respect. Our results hence 
present a compelling argument for the necessity of the uncertainty relation in quantum and post- 
quantum theories. 



Introduction. — The Uncertainty principle, stipulat- 
ing that certain observables cannot be specified simul- 
taneously, is considered the cornerstone of quantum 
physics. It was emphasized by Heisenberg to be a de- 
parture point of quantum physics from the classical view 
which maintains that all properties of physical systems 
can be known at the same time [l|. Bohr used the 
quantum uncertainty to develop a broader view known 
as "complementarity" which, he emphasized, existed be- 
tween different physical measurement scenarios p|. The 
fact that quantum physics implies that a particle's po- 
sition and momentum cannot be known simultaneously 
has been considered counterintuitive and there have been 
many unsuccessful attempts to go beyond it. 

Here we show that uncertainty itself is required by a 
very simple principle we call "branch locality", which 
stipulates that any action in a spatial region having zero 
probability of containing a system cannot have any effect 
on the state of that system. 

Our arguably simple proof makes use of the well- 
established convex framework for probabilistic theo- 
ries , which has been used to show interesting results 
in the fields of information theory Q , statistical mechan- 
ics 0] and axioms for quantum theory 0, 0, H| ■ As an op- 
erational framework, essentially any experiments yielding 
tables of data can be described in this way [!, H[ . A con- 
sequence of our results is that computation via spatial 
interferometry appears useless in any post-quantum the- 
ory without the uncertainty relation. 

Branch locality is similar in spirit to a commonly used 
locality principle known as non- signalling: that acting 
on one system does not have any immediate observable 
impact on another system at a different (space-like) lo- 
cation. Quantum theory respects non-signalling as the 
state of a subsystem is invariant under local operations 
on the other subsystems. (Violating non-signalling is 
distinct from Bell-violation, which quantum theory, of 
course, does exhibit). It seems very unlikely that any 
post-quantum theory will violate non-signalling. 

Branch locality (BL), in contrast to non-signalling, is 



applicable to single systems. We consider a system which 
may be found in certain spatial regions with some prob- 
ability. We demand that if the system has no probability 
of being found in a particular region, then actions on that 
region cannot have an observable impact on the system. 
We call this innocent-sounding principle branch locality, 
because we are envisaging different branches of an inter- 
ferometer, see Figure [TJ 
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FIG. 1: The Mach-Zehnder interferometer. Branch local- 
ity states that if the (possibly post-quantum) photon is with 
probability 1 to be found in one of the branches of the in- 
terferometer, then operations on the other spatially disjoint 
branch cannot change the (operational) state of the photon. 

In a standard quantum interferometer experiment one 
can change the state, and thus the statistics of some mea- 
surements, by local operations on the branches (e.g. by 
introducing an optical fibre detour in a branch). These 
operations respect BL. In contrast, we find that if a the- 
ory violates a particular instance of the quantum uncer- 
tainty relation, then all state transforms violate BL. In 
other words, a theory for such experiments must either 
be trivial or respect that uncertainty relation. 

For alternative perspectives on interference in post- 
quantum theories see for example [ToL fllj . 

Modelling an interferometer experiment be- 
yond quantum theory. — In the case of quantum the- 
ory one may describe an ideal Mach-Zehnder interferom- 
eter (Figured]) using a single qubit, i.e. a 2-dimensional 
Hilbcrt space. The state after the first beam-splitter 
can be expressed in the which-branch basis as \tp) — 
Cupl-^up) + ciow|ziow)- The observable giving the expected 
position corresponds to z up \z up ) (z up \+ z iow \z iow ) (z iow \ for 
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some labels 2: U p, z\ ow that we assign to the respective 
branches. It will here be convenient to label these ±1 
respectively so that the observable is modelled by the 
Pauli-matrix Z = \z up )(z up \ — |^iow)(^low|- (The argu- 
ment will also work for other labellings than ±1). 

The state space of a qubit is the well-known Bloch 
sphere, see Figure^ Here a state is represented by a real- 
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FIG. 2: State space: the quantum state space is the (Bloch) 
sphere. We also consider the possibility of other states outside 
the sphere. The case of the maximal cubic state space is an 
instance of the so-called 'box world'. We use branch locality 
to show that the only states allowed on the faces of the cube 
are those right in the centre of each face (where the sphere 
touches the cube). 

numbered vector of expectation values: [(X), (Y), (Z)] T 
where X and Y are the other two Pauli-matrices, and 
(g) = p(g = +1) — p(g = —1). Mixtures of states cor- 
respond to probabilistic (convex) combinations of these 
states, lying inside the sphere of pure states defined by 

(X) 2 + (Y) 2 + (Z) 2 = 1. (1) 

The above equation constitutes an uncertainty relation; 
for example if (Z) = 1 one must have (X) = (Y) = 
0. The more familiar formulation in terms of standard 
deviations, that AXAZ > ±\{[X,Z])\ = \(Y)\, is implied 
by Eqffl (recall that (Ag) 2 = (g 2 ) - (g) 2 ). 

The real vector used above amounts to an operational 
description of the state. One may now entertain the pos- 
sibility of post-quantum states by associating them with 
points outside the sphere of pure quantum states. For 
example, the state (1, 1, 1) T is not allowed in quantum 
theory as it violates the uncertainty principle of EqJT] 
We shall here a priori allow such states and later rule 
them out. 

It will be crucial to our argument to consider transfor- 
mations of states. These must take all allowed states to 
allowed states. They must also respect probabilistic mix- 
tures: for a transform T acting on a mixture of two states, 
V\ and 2/2, we have T {p\V\ + piv-i) = p\T (vi)+P2T (z^). 
We show in the technical supplement, using established 



results on convex theories, that transforms are then real- 
numbered matrices, up to the subtlety that one should 
now add an extra component n to the state vector cor- 
responding to the 'normalisation' of the state (n = 1 for 
normalised states). 

We shall here consider states of the form 

v= [n,(Z), (X 1 ),(X 2 ),(X 3 )...] T , 

where we use the notation (g) = p(g = +1) — p(g = — 1) 
(if a state is subnormalised, in which case this takes val- 
ues in the range — n to n). We shall consistently take (Z) 
to be associated with position, such that Z = +1(— 1) is 
associated with the upper (lower) branch (we show in 
the technical supplement that the argument does not de- 
pend on this labelling). The number of measurements 
used to specify the state is restricted to three in quan- 
tum theory, with X\ and Xi the Pauli matrices X and 
Y respectively. However we shall allow for a larger or 
smaller number than that. 

A theory is specified by the set of states (which im- 
plicitly assumes a set of measurements and outcomes has 
been defined) and the allowed transforms. Apart from 
making general statements about theories, we shall re- 
fer to three concrete examples. The quantum qubit case 
has states represented as v = [n, (Z), {X), (Y)] T . The 
allowed transforms consist of both reversible SO (3) trans- 
forms as well as linear transforms shrinking the sphere. 
Secondly, we shall call the case of a diagonal density ma- 
trix (s.t. (X) = (Y) = 0) the classical case, modelled 
as v = [n, (Z), 0,0}. Here any matrix preserving or 
shrinking the line of states is allowed. Finally, the max- 
imal state space of all probabilistic mixtures of the cor- 
ners (n, ±ri, ±n, ...) is known as the state-space of a single 
system in box-world. The special case of (n, ±n, ±n) is 
termed a This can (for n=l) be visualised as the 

square X-Z plane slice of the cube in Figure [2] Gbits are 
currently of great interest in the context of understand- 
ing whether there can be Popescu-Rohrlich (PR) boxes. 
These are hypothetical maximally Bell-violating systems, 
see [f3] or the technical supplement. Roughly speaking, 
the gbits are to a PR-box what qubits are to a Bell state. 
More specifically, the pure states of a gbit, the corners 
of the maximal state space, are the conditional marginal 
states of a PR-box — see the technical supplement. Thus 
if a PR-box can exist then so can a gbit. In box-world 
the allowed transforms on single systems are taken to be 
any matrix that preserves or shrinks the state space (but 
our arguments will apply even if one is not so permissive 
with the tranforms). 

Branch Locality. — Branch locality, as described in 
FiglU can now be formalised as an operational principle. 

Definition 1 (Principle of branch locality). Physi- 
cal actions on one region of space have no immedi- 
ate effect on systems with no probability of being de- 
tected in that region. In particular, a system defi- 
nitely in the upper branch such that its state is z? up = 
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[n,(Z), (Xj), (X 2 )...f = [n,l, (Xi), (X 2 )...] T ls te/t in- 
variant by any transform on the lower branch: 

(similarly T up Pi ow = v\ ovl .) 

We shall also impose a more obvious condition on op- 
erations at different branches. We take the state to be 
described by someone without access to outcomes of any 
measurements performed on the local branches. Local 
transformations T acting on the respective branches must 
then not alter the statistics associated with the Z mea- 
surement, i.e. if the transform takes (Z) to (Z)' then 

(z) = (zy. (3) 

Example 1: Box world (gbit). — Using the notation 
defined above, if we take a state to be in the upper branch 
with certainty it must have the form z7 up = [n, n, (X)] T 
(recall that n is the state normalisation with n = 1 for a 
normalised state). 

Consider an operation on the lower branch. From the 
above considerations, Equation [5] and Equation [3] both 
hold. Recall moreover that the transform is a matrix. It 
follows from Equation [2] that 
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One can write down a similar expression for Equa- 
tion 21 Consider the ranges of the different variables for 
gbits: n can take values in the range to 1 and (X) as 
well as (Z) from -n to n. Note that even when (Z) = ±1, 
(X) is free to take any value in the range — n to n. Note 
also that T\ ow is independent of the state it acts on. It 
follows with a little work that 
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Similarly by considering (Z) = — 1 we have T up = 1. 

Thus no non-trivial operation satisfying branch local- 
ity can be applied on either branch. However, there are 
clearly transforms leaving Z invariant that are not of this 
trivial form, e.g. T* — diag(l, 1, — 1). This transform if 
allowed would provide non-trivial interference. However, 
we have shown that T* cannot be associated with a lo- 
cal operation on a branch! If we assume all box-world 
states can be realised in the interferometer, and any non- 
trivial transform can be performed, we must conclude 
that box world violates branch locality. 

Uncertainty principle. — The above proof for a gbit 
that all non-trivial dynamics violate branch locality does 
not carry through in quantum theory. The proof makes 
use of that fact that, for gbits, when (Z) = ±1, (X) is 
still free to take all possible values. This is a violation of 
the quantum uncertainty principle and a key difference 



between box world and quantum theory. We now argue 
that any convex framework theory violating such an un- 
certainty relation (defined below) has no state-changing 
dynamics respecting branch locality. 

We consider a set of maximally independent measure- 
ments Z and Xi . . .X n , all taking values -1 and 1. (We 
call a set of measurements {gt}i maximally independent 
if we can write v = X)i(ffi)^ anc ^ &i ' = 

Definition 2 (Uncertainty principle for maximally inde- 
pendent measurements). // \(Z)\ — 1 then (Xi) = OVi. 

The uncertainty principle is said to be totally violated 
if the set of states where \{Z)\ = 1 includes at least one 
state where (X\) ^ 0, one state where (X2) 7^ 0, etc. 

Theorem 1. In theories that satisfy branch locality, the 
transforms T on local branches (thus respecting Eq. 0) 
must have the form: 

T =1 

except when the uncertainty principle is upheld. 

The proof, which follows similar lines to the example 
above in a more complicated setting, is in the technical 
supplement. 

Example A: Qubit. — For pure states of a qubit (X) 2 + 
(Y) 2 + (Z) 2 = 1. Thus when (Z) = ±1 both (X) = 
and (Y) = 0. Thus, labelling the measurements X\= X 
and X2 — Y, we see that a qubit satisfies the uncertainty 
principle of Definition [2l Theorem [TJ does accordingly 
not imply that the transforms in this case have to be 
1. Instead, one can see that imposing BL, in particular 
Equations [3] and [U implies that 
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for both the upper and lower branches. This confirms 
what is well-known, namely that quantum theory can 
describe dynamics other than 1 in an interferometer. 

Example B: Box world. — Box world, even beyond the 
2-dimensional gbit case, violates the uncertainty princi- 
ple. Thus from Theorem [TJ the only operation that may 
be applied without violating branch locality is the trivial 
clement T = 1. In other words, all box- world transforms 
violate BL. Note that this statement does not only apply 
to box-world, but to any state space with more states on 
the faces of the outer cube than those right in the centres 
of the faces. 

Example C: Classical. — The classical case (also defined 
above) is a subspace within quantum theory, and thus 
surprisingly respects the uncertainty relation. Accord- 
ingly Theorem [TJ does not have a bearing on transforma- 
tions in this case. Nevertheless one sees the transforma- 
tions are trivial as there is only one degree of freedom, 
Z, and from Equation^ it is conserved. 
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Example D: Which branch measurement. — We now 
give an interesting example of a transform which should 
respect branch locality in any theory: a measurement of 
which branch the particle is in. It is not obvious a pri- 
ori that this should respect branch locality as it appears 
to be a multi-branch operation. Suppose however that 
one places a (non-destructive) detector in the lower path 
only. Physically this is an operation on the lower branch 
only, and the associated transform of the state should ac- 
cordingly leave states associated with the upper branch 
invariant, i.e. it should respect branch locality. More- 
over Eq. [3] must be respected. Accordingly, our results 
can be applied here to imply that the state transforma- 
tion associated with the detector has to be 1, unless the 
uncertainty relation is respected. This suggests a close 
link between our results and measurement disturbance 
(as a disturbing transform cannot equal 1). 

Consequences for computation. — As computers 
are physical objects, the laws of nature dictate the limits 
of computation. In particular, a computation has to be 
performed as a dynamical evolution of a physical system. 
Our results can be interpreted as saying that computa- 
tion using a Mach-Zehnder interferometer is trivial un- 
less the uncertainty relation holds. Whilst this is a very 
particular experimental setting, it stands out as being 
the original setting in which quantum computation was 
conceived. The Deutsch-Jozsa algorithm [13J arises nat- 
urally by considering what one can do with a quantum 
system in an interferometer. In [l4j | it was argued more 
generally that key quantum algorithms can be viewed 
as a three stage interferometer experiment: (i) prepare 
a superposition of different branches, (ii) apply differ- 
ent phase-shifts to different branches, and (hi) bring the 
branches together and make a measurement, yielding in- 
formation about the phase shifts that were done. The 
apparent ability to prepare and individually address sev- 
eral in put s in stages (i) and (ii) is called quantum paral- 
lelism [13] (distinct from classical parallel computation). 

Our result accordingly hints that an uncertainty rela- 
tion is required to achieve this parallelism, particularly if 
the results for spatial degrees of freedom can be applied 
to all degrees of freedom. It suggests that parallelism 
may not even be a sensible concept in a spatially dis- 
joint system without uncertainty. This directs the search 
for post-quantum theories with stronger computational 
power to those that respect the uncertainty relation, such 
as 'systems with limited information content' |15j . 

It may seem counterintuitive that the restriction of un- 
certainty leads to better computation, but this is because 
branch locality places a joint restriction on the states 
and transforms, such that the restriction on transforms 
is weakened by restricting the set of states. 

It is already known that box world has very poor dy- 
namics [f| [l6[ , but the branch locality principle has more 
dramatic implications than previous results. It rules 
out any non-trivial normalisation-preserving dynamics, 
whether reversible or not, and by non-trivial we mean 
any transform that is not the identity (whereas the word 



'trivial' in the title of [l6[ refers to the lack of correlating 
interactions). Moreover we showed the same restriction 
holds for any convex theory on bits not satisfying the 
above uncertainty relation. 

Relation to other principles. — The principle is, as 
already mentioned, very similar to non-signalling; the 
two principles seem to be instances of some wider notion 
of locality. More subtly they can also both be viewed as 
defining what it means to have an operation on a subsys- 
tem or, in the case of branch locality, an operation on a 
branch. It would seem impossible to talk about notions 
like systems localised in space without these principles. 
We think they are both equally fundamental and unlikely 
to be violated in a post-quantum theory. However, stan- 
dard non-signalling does not rule out box world, which 
has been designed to respect it. It is surprising that an- 
other locality principle does @. 

There are also many interesting information theoretic 
principles one may compare and relate branch-loc ality 
to. The one-bit principle, as borne out e.g. in @, Qjl fl8| . 
has a very similar mathematical implication, but with a 
radically different motivation. The principle, in one of 
its many similar formulations, states that an elementary 
system (a system with 2 perfectly distinguishable states) 
can only give a definite result in one specific measure- 
ment. The mathematical formulation of this in 18] is 
actually very close to what we here call the uncertainty 
principle for maximally independent measurements. Our 
argument accordingly supports the one-bit principle. 

Outlook. — Whilst the uncertainty principle we con- 
sider above is a very significant restriction, the full quan- 
tum uncertainty relation also restricts states where none 
of the measurements are deterministic. It would be inter- 
esting to investigate possible extensions of branch local- 
ity to include such cases. For example one would expect 
that states with little weight on one branch can only be 
slightly changed by operations on that branch. Alter- 
natively one may investigate combining branch locality 
with some other fundamental principle(s), as part of a 
program of exploring which convex theories are consis- 
tent with space-time. 

As there is a relation between the uncertai nty relation 
and the amount of Bell violation in a theory pjl [2(| , one 
may hope to link Tsirelson's upper bound on quantum 
correlations with branch-locality. 

We have refrained from claiming that branch locality 
applies to other degrees of freedom than position, but we 
anticipate that it is (this is actually why we call it branch 
locality rather than path locality which could otherwise 
have been more natural). 
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Technical supplement 

Introduction to the general probabilistic theory 
(GPT) framework 

We firstly introduce the key concepts of the opera- 
tionalist approach we will use: the framework for gen- 
eral probabilistic theories ( GPT) also known as the con- 
vex framework. For a more detailed description of the 
framework see e.g. @, Q- The framework is operational 
in the sense that essentially any experiment producing a 
data table can be described in this way 0, H j . For read- 
ers familiar with quantum theory it can also be helpful 
to think of the GPT framework as a generalisation of 
quantum theory. In quantum theory there is a system 
which is prepared in a state p, determined by the prepa- 
ration in question. There is a set of measurements one 
may do, each represented by a set of projection opera- 
tors (or more generally POVM elements). The 
operationally significant quantities, the probabilities of 
given outcomes are given by pi = Tr(pTli). Viewed more 
abstractly, the state is a vector p in the vector space of 
Hermitian operators. A projection operator is also such 
a vector, II say. In other words we may pick a basis of 
the vector space and write p = J2i an d n = J^j v j^j- 
(Examples of such bases are the Pauli operators and the 
pure state basis given by Hardy in Q). Note that the 
coefficients of these expansions are real, so this is termed 
a real vector space. Tr(pH) is then the Hilbert-Schmidt 
inner product (for Hermitian matrices) which we may 
write as (p, II) . 

If the basis elements are chosen so that they are orthog- 
onal with respect to the norm, and all having the same 
inner product c with themselves, we see that (p, II) — 
£ • vc where the right-hand-side is the standard Euclidean 
norm. Thus we may represent a quantum state, measure- 
ments on it, and the resulting probabilities in terms of 
real vectors and the Euclidean norm. 

In the framework we more generally represent the state 
of a system s as a real vector, and the measurement- 
outcome pairs, called 'effects' for historical reasons, as 
real vectors e (such a vector could for example in the 
quantum case be associated with X=+l, where X is the 
Pauli X) . The probability of the outcome associated with 
a given e is given by s ■ e. Part of specifying a given 
theory is specifying which states and effects are allowed. 
All convex combinations (mixtures) of allowed states are 
always allowed, written as s = ^2iPisl (hence the name 
'convex framework'). A state is said to be pure if it is 
not a non-trivial mixture of other states, otherwise it is 
called mixed. 

Transformations are represented as real matrices act- 
ing on the state vector (following from the requirement 
of respecting mixtures, see e.g. Q). They must take all 
allowed states to allowed states, but there may be further 
restrictions specified for a given theory in the framework. 
A transformation T is termed reversible if its inverse T _1 



is also an allowed operation. 

As well as quantum theory and theories contained 
therein (such as classical probability theory), one may 
also formulate a theory called box world in this way. 
Box world contains all states that do not violate non- 
signalling (that the reduced state of one system is invari- 
ant under operations on another) Q . The standard ver- 
sion of box world assumes there are only two binary out- 
come measurements under considerations. We label these 
X and Z and the outcomes ±1 in analogy with quantum 
theory. A normalised state can then, as discussed below, 
be represented as v = [(X) (Z)] T and is any mixture of 
the four extremal states [±1 ± 1] T . The most general 
box worlds are m-in n-out box worlds, which mean that 
one selects one measurement setting from m possible set- 
tings and obtain n-valued outcomes. In particular, the 
3-in 2-out box world is the most analogous to a qubit in 
quantum theory. 

We shall assume throughout that if a given v is allowed, 
the same state with one or more of the entries replaced 
by is also allowed (an entry being corresponds to that 
observable being completely unknown). 



Expectation value representation 

We shall here find it makes the notation very simple to 
focus on the expectation values of measurements, along 
the lines of the above example for box- world. In what 
follows we define a representation of states in terms of 
expectation values and relate it to the more standard 
representation in terms of probabilities, including show- 
ing that the transforms are matrices also in the new rep- 
resentation. 

Consider the case of states described in terms of two 
fiducial measurements with binary outcomes (measure- 
ments are called fiducial if their statistics are sufficient 
to determine the state). We shall here only consider mea- 
surements with binary outcomes (because we shall be in- 
terested in interferometers with two branches), but we 
will later generalise to many such measurements. Such 
outcomes need not be normalised, but we require the sum 
of probabilities of both measurements to be equal. 

Consider the probability representation of a state @: 



p(X = 
p(X = 



p(Z = 
p(Z = 



T) 
-1) 



The following is the alternative (normalisation-including) 
expectation value representation: 







n 




n 


v := 


p(X = 


+l)-p(X = -l) 




(X) 




p(Z = 


+l)-p(Z=-l) 




(Z) 



where the normalisation n = 1 if the state is normalised 
and n < 1 if it is subnormalised. (For a subnormalised 
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state we still use the notation (g) := p(g = +1) — p(g = 
— 1), implying that — n < (g) < n) 

If a transform T acts as a matrix on the state-vector 
in the probability representation, as it should if it re- 
spects mixtures, is it also a matrix in the expectation 
value picture? Suppose for the sake of argument that (i) 
There exists a fixed matrix M such that V = Ms for all 
states, (ii) The effective inverse matrix M~ x also exists 
satisfying M~ 1 Ms — s Vs . Then we can write 

MTs = MTM~ 1 Ms = MTM~ 1 v = v' 



where v' is the expectation representation state after the 
transform. Thus we see that if the two assumptions above 
hold then the state transforms by a matrix also in the ex- 
pectation value picture. Moreover these two assumptions 
do hold here, with 



labelled A and B here. The state can be represented as 



M= 



1/2 1/2 1/2 1/2 
1-10 
1-1 



, M~ 1 = 



1/2 1/2 

1/2 -1/2 

1/2 1/2 

1/2 -1/2 



The above argument naturally generalises to more mea- 
surements. 

Not also that one could have used a different label for 
the positions in the probability picture, i.e. not ±1 and 
then mapped that into the expectation value picture us- 
ing the same matrix as above. In this sense our argument 
does not depend on how one labels the two positions. 



p(X A = 


+i,x B = 


+1)" 




- 1 

2 


p(X A = 


+i,x B = 


-1) 







p(X A = 


-i,x B = 


+1) 







p(x A = 


-i,x B = 


-1) 




1 

2 


p(X A = 


--1,Z B = - 


H) 




1 

n 


p(X A = 


+1,Z b = 


-1) 







p(X A = 


-i,z B = 


+1) 







p(X A = 


-i,z B = 


-1) 




_L 

2 


p(Z A = 


+i,x B = 


+1) 


— 


1 

2 


p(z A = 


+i,x B = 


-1) 







p{z A = 


-i,x B = 


+1) 







p(z A = 


-i,x B = 


-1) 




i 


p(z A = 


+1,Z B = 


+1) 







p(z A = 


+i,z B = 


-1) 




1 


p{Z A = 


-i,z B = 


+1) 




1 

2 


p(z A = 


-i,z B = 


-1) 




. 



Now consider for example the case where a measure- 
ment of X A is performed, and one finds X A = +1. Then 
we see the state of B, called the conditional marginal 
state, must be 



SB 



P(X B = 
P(X B = 



+1) 
-1) 



p(z B = 
p(z B = 



+1) 

-1) 





- 1 - 









1 




.0. 



i.e. the conditional marginal state is one of the four pure 
states of the gbit state space. One can see by considering 
X A = — 1, Z A = ±1 that the other three pure states can 
be similarly prepared. Thus under the assumption that 
a PR-box is allowed, and that mixtures of all states are 
allowed, then the most general gbit-state is allowed. 



B. Proof of main theorem 



A. PR-boxes and gbits 

We now briefly describe the connection between PR- 
boxes and gbits. This subsection is included for com- 
pleteness and does not contain new results, see in partic- 
ular [6] for essentially the same statements. 

In the probability representation a general gbit state 
is a mixture of four pure states: 





- 1 " 




- 1 " 




- o- 




- 0" 














1 




1 


Sgbit := Pi 


1 


• +P2 





• + P3 


T 


■ +Pi 


T 









1 









l 



Thus under the assumptions that (i) the four pure 
states are allowed and (ii) any probabilistic mixtures of 
states are allowed, then all such states are allowed. 

Now consider a PR-box. This involves two systems, 



Theorem. In theories that satisfy branch locality, the 
transforms T on local branches must have the form: 

T =1 

except when the uncertainty principle is upheld. 

Proof. For T to leave the statistics of Z invariant, it must 
satisfy, for all (Z) and n 



r uq n\ 

Z r Z Zl 

a^or £oo ^01 



n„ 





n 

(Z) 
(Xi) 




n 

(Z) 

(XiY 




- (Xn) - 




.(x n y. 



•Enr *£n0 -^nl ■ • • ■ 

This gives a set of simultaneous equations: 

rn + n (Z) +ni(Xi) + ... + n n (X n ) = n, (4) 
z I n + z (Z)+z 1 (X 1 ) + ... + z n (X n ) = (Z), (5) 
x ir n + x i0 (Z) + x l i(Xi) + . . . + x in (X n ) = (X,-)'.(6) 
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These must hold for all allowed n, (Z) and 
(Xi) . . . (X n ). As is always a possible expectation value 
(we can always choose to forget the outcome of a mea- 
surement), we can set (Z) = and (Xi) = OVi, and 
so Equation 2] shows that r = 1. We can choose non- 
zero (Z) and see from Equation [5] that Zq = 1, and from 
Equation |H that no = 0. We can also choose to set all but 
one (Z), (X^ = 0, so that Equation [5] implies that each 
Zi = for i = 1 . . . n, and Equation U that each n, = 
for i = . . . n, giving us the form of the top two rows of 
the transform. 

Moreover, for T to respect branch locality, taking Z to 
be the position measurement, we can set (Z) — n and 
(Xi)' = (Xi) to give the set of equations: 

x„n + x M n + xn(Xi) + . . . + x in (X n ) = (Xi). (7) 



Again, we can choose to set all (Xi) = 0, and so Equa- 
tion [7] will reduce to x„- + x iQ = OVi. 

When uncertainty is totally violated we are free to 
set all (Xi) = except one (Xj) ^ 0, and from Equa- 
tion [7] we see that Xji = Sji where Sji is the Kronecker 
delta. Furthermore, normalisation constraints will re- 
quire Xi r ,Xio = 0. Thus, for total violation of the uncer- 
tainty principle, the only operation allowed is T — 1. 

The case where the uncertainty principle is partially 
violated (i.e. there exists at least one non-zero value of 
(Xi) for some i), then it can also be shown that that the 
sub-matrix linking measurements without an uncertainty 
constraint is the identity. The proof is nearly identical 
to the case for total violation and so is omitted. 

□ 



